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CRF Errors CorroctedTjyUToTIC Systems Branch ' " *Z / j J 

Se'rial Number: /a/cM.33g ^ T^^-^ Wi 

| — | Changed a file from non-ASCSI to A$G\r* ly >W Verified by: f Z (STIC staff)- 

| ] Changed the margins in cases where the sequence ISkf^p^^ypped" down to the next line. 

[~] Edited a format error in the Current^Application Data section, specifically: 

^. ________ _ 

| | Edited the Current Application Data section with the actual current number. The number inputted by the 
applicant was Q the prior application data; or □ other 

rn Added the mandatory heading and subheadings for 'Current Application Data". . 

[~ j Edited the "Number of Sequences" field. The applicant spelled out a number instead of using ^in tiger, 




§ ^ O 

| } Changed the spelling of a mandatory field (the headings or subheadings), specifically: ^ 

J J co -rs 

| | Corrected the SEQ ID NO when obviously incorrect. The sequence numbers that were edited w@: ^ ffl 

2 1— O 

I I Inserted or corrected a nucleic number at the end of a nucleic line. SEQ ID NCs edited: 

| [ Corrected subheading placement. All responses must be on the same line as each subheading. If the 
applicant placed a response below the subheading, this was moved to its appropriate place. 

| [ Inserted colons after headings/subheadings. Headings edited included: . 

| I Deleted extra, invalid, headings used by an applicant, specifically: 

L ™ 1 / 



Deleted: Q/hon-ASCII "garbage" at the beginning/end of files; O secretary initials/filename at end of file; 
□ page numbers throughout text; □ other invalid text such as . 

| | Inserted mandatory headings, specifically: 

I I Corrected an obvious error in the response, specifically: 

I | Edited idgotifierjs where upper case is used but lower case is required, or vice versa. 

| 1 Corrected an error in the Number of Sequences field, specifically: 

1 1 A "Hard Page Break" code was inserted by the applicant All occurrences had to be deleted 

1 | Deleted ending stop codon in amino acid sequences and adjusted the "(A)Length:" field accordingly (error 
due to a Patentin bug). Sequences corrected: , 

□ Other ^ 



'Examiner: Th abov corrections must b communicated to th applicant In the first Office 
Action, DO NOT send a copy of this form. 3/1/95 
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SEQUENCE LISTING 

1 (1) GENERAL INFORMATION : 

2 (i) APPLICANT: SmithKline Beecham pic et al 

3 (ii) TITLE OF INVENTION: Novel compounds 

4 (iii) NUMBER OF SEQUENCES: 19 

5 (iv) CORRESPONDENCE ADDRESS: 

6 (A) ADDRESSEE: SmithKline Beecham 

7 (B) STREET: Two, New Horizons Court, Great West Road 

8 (C) CITY: Brentford 

9 (D) STATE: ^ 

10 (E) COUNTRY: UK TJJ 

11 (F) ZIP: TW8 9EP J ^ 

12 (V) COMPUTER READABLE FORM: C"> ^ 

13 (A) MEDIUM TYPE: Diskette *S ^° 

14 (B) COMPUTER: IBM Compatible ^ V-* 

15 (C) OPERATING SYSTEM: DOS 33 GO 

16 (D) SOFTWARE: FastSEQ for Windows Version 2.0 £o S 

17 (vi) CURRENT APPLICATION DATA: § Q 
C--> 18 (A) APPLICATION NUMBER: US/10/071 f 338 ro 
C--> 19 (B) FILING DATE: 08-Feb-2002 g 

20 (C) CLASSIFICATION: 

21 (Vii) PRIOR APPLICATION DATA: 

22 (A) APPLICATION NUMBER: 

23 (B) FILING DATE: 

24 (viii) ATTORNEY/AGENT INFORMATION: 

25 (A) NAME: Valentine, Jill B 

26 (B) REGISTRATION NUMBER: 

27 (C) REFERENCE/DOCKET NUMBER: P31731 

28 (ix) TELECOMMUNICATION INFORMATION: 

29 (A) TELEPHONE: 0181-9752000 

30 (B) TELEFAX: 0181-9756294 

31 (C) TELEX: 

32 (2) INFORMATION FOR SEQ ID NO: 1: 

33 (i) SEQUENCE CHARACTERISTICS: 

34 (A) LENGTH: 7193 base pairs 

35 (B) TYPE: nucleic acid 

36 (C) STRANDEDNESS : single 

37 (D) TOPOLOGY: linear 

38 (ii) MOLECULE TYPE: Other 

39 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

40 CCATGGCGGG CGGCGGCTGC CCCGGAGCCT CGGCCGGACC GGTGACCAGG ACCACCCCGG 

41 TGGGATAGTG GCCCGCCACC CGGCGCAGCA GACTCCCGGA CACGGACCCG TGGGTGTGCG 

42 CGGAAAGGCC CGGAGGCCGG GTCACAGCCA CGGGTAACGC GCGGTGTCCT TGCCCGCGTA 
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43 ATCGGGGTCC AGATAGACGA AGGCCCGGTG GACGAGGAAG TCCCGCACCT CGTAGACCGT 240 

44 GCACCAGCGC CCGGCGGCCC ACTCGGGGTC ACCCGCCCGC CACGGCCCGT CCCGGTGCTC 300 

45 ACCGTGGGTG GTGCCCTCCG CGGCGAGGAG TTCGGTCCCG GTCAGAATCC AGTTGACGGA 360 

46 CCACAGATGG TGGGTGATCG AGCGGATGGT GCCCCCGAGG TCGTCGAAGA GCCGGGCGAT 420 

47 CTCGGACTTG CCCCGGGCCA GACCCCACTT GGGGAAGAAG AAGACCGCGT CCTCGGCGAA 480 

48 GTAGTCGATC GCGGGGGTGC CGTCGCTGCC GACGCCGCCG TTGTCGAACG CCTTGAAGTA 540 

49 CGCGGTGATG ACCGCCTTGC GCTGCTCGTC CGTCATACCG GCCGATGCCA CGGACATGAA 600 

50 ACGACCTCCA GAGATTCCGG GTGGCTGTGC TGGGGCTGCG GAAGGGGTGT CCCCCGCGAA 660 

51 GGACGGCGGA CGCCGCGGAC GCCGCGGCCG TCTCCCCGGC GGACGGGTCC CAGCGTCCTG 720 

52 GAGAGGGCTT GGCGGCGGCT TGACGCCGTG CTGTCCCGCG GCTTGCGGAA CGCGAAGTAC 780 

53 CGGCCAGCGT ACGGGCGTTG CACCGGACGT GTACGCCGGT CGGGACCCCT CGTACCCCCG 840 

54 GAGCCGGCCG ACCCCGGCGG CTCCGGGGGT ACGGACGCGC CGGACCGGCC CGAGCGAGCC 900 

55 GGACGGGTCG GACGGTGCGC GTGGTTCCGG TGTGTCGGAC AGCTCGGACG GACCGGACGG 960 

56 TGCGCGTGGT TCCGGTGTGT CGGACAGCTC GGACGGGTCG GACGGTGCGC GTGGTTCCGG 1020 

57 CACGCCGGAC GGGTCAGTTG CCGATCATGG CGAGCAATGC CGGGGTGTAC CGCTCCCCGG 1080 

58 ACACCGGGTG GGAGATCGCG GCCGTCACCT CCGCGAGGGA CCGGTCGTCC AGCCGGATCG 1140 

59 AGGCGGCGGC GAGATTGTCC GCGAGATGGG CCGGGTTCGC GGTGCCCGGG ATCGGGACGA 1200 

60 CGTCCTCGCC CCGGTGGTGC AGCCAGGCGA GCGCGAGCTG TGCCAGGGTC AGCCCCAGAC 1260 

61 CGTCCGCGAC CGGGCGCAGC CGGTGCAGCA ACGAGCGGTT GCGCGCGAGG GCCGGAGCGC 1320 

62 TGAACCGGGG CTGGCCCCGG CGGAAGTCCT CGTCCCCCAG ATCGTCGGTG GTGCGGATGG 13 80 

63 TGCCGGTGAG AAAACCCCGT CCCAGAGGGG CGTAAGCGAC GATCCCGATC CCCAGCTCCC 1440 

64 GGCAGACGGG CACCACCTCG TCCTCGATCC CGCGCGACCA CAGGCTCCAC TCGCTCTGCA 1500 

65 CCGCCGTCAC CGGGTGCACC GCGTCCGCCC GGCGCAGCGT GGCCGCGGAG GGCTCGGAGA 1560 

66 GACCGAGCCT GCGGACCTTG CCCTCGCGCA CCAGCTCGGC CACCGCACCC ACGGTCTCCT 1620 

67 CGATCGGCAC CGCCGGGTCC GTCCAGTGCT GGTAGTACAG GTCGATGCGG TCGGTGCCGA 1680 

68 GACGACGCAG GGACCGTTCG CAGGCCGCGC GGACGTAGGA CGGCTCGCCG CACAAGCCCT 1740 

69 GGGAGGCGCC GTCGGACGAG CGCACCATGC CGAACTTGGT GGCGATCAGC ACCTCGTCCC 1800 

70 GGCGGCCCGC GACCGCCCGT CCGAGCAGCT CCTCACCGGC GCCGAGCCCC TGGACGTCGG 1860 

71 CGGTGTCCAG CAGGGTGACC CCGGCGTCGA CGGCGGCGCG GATGGTGGCC GTCGCCCGGG 1920 

72 CGCGGTCCGG GCGTCCGTAG AAGTCGGTGG TCGGCAGGCA GCCGAGCCCC TGGGCACTGA 1980 

73 CCGGAAGGTC CCGCAGGGCG CGGACCGGCG GACGCGGAAC CGCGGCGGAC ACGGAACCGG 2040 

74 CCGGGGACTC GGGCGGAGAG CGGGACATAC GGAACCTCCA CAGGCGGAGC CGGGAACGGG 2100 

75 ACGAGGGCGA GGACGGGACG GAACGAAGGA GAGGACGGGA CGGACAGCAC GGACGGGACG 2160 
7 6 GACGGAACGG AGTCGGGAAC CGGGGGGGGT GACCGGAACC GGGCCGTCCT TGGCCCTCCC 2220 
77 CCGTCCTCCC CGCCATCCGC CGTTCTCCCC CGTTCCCTCT CCCGTCCTCC AGCCAACACC 2280 
7 8 GCCGCCCTTT CCAAGCGCTT GACACGGCAC CGACAGCCGC CGCCGGGCGC CCGATGGGGA 2340 

79 CCCGTGCCCG CCGGTGAGCG GCGGTGAGCG CCGGTACGGG ACCCCACGCG CCGCCGCCCG 2400 

80 GGCGCCCGCC AGGGCCCGCG CGGCCACCCC GGCCCGCCCC GGCCGGAGCG GCGATCCGGG 2460 

81 CCGCTCGCTG CAAGAGGAAC ATCCACAGCC GCACAAGGAG CGCTCCGCAC AGTGGGCACC 2520 

82 ACGTCCGCCC CGTCCCCCAC ACCGTGGCCG GTCCCCACCG GACAGCACAG CACCGCACAG 2580 

83 CACCACATCG CACGGCACAG CACAGCACCA CCGGCACGAG GAACCAAGGA AAGGAACCAC 2640 

84 ACCACCATGA CCTCAGTGGA CTGCACCGCG TACGGCCCCG AGCTGCGCGC GCTCGCCGCC 2700 

85 CGGCTGCCCC GGACCCCCCG GGCCGACCTG TACGCCTTCC TGGACGCCGC GCACACAGCC 2760 

86 GCCGCCTCGC TCCCCGGCGC CCTCGCCACC GCGCTGGACA CCTTCAACGC CGAGGGCAGC 2820 

87 GAGGACGGCC ATCTGCTGCT GCGCGGCCTC CCGGTGGAGG CCGACGCCGA CCTCCCCACC 2880 

88 ACCCCGAGCA GCACCCCGGC GCCCGAGGAC CGCTCCCTGC TGACCATGGA GGCCATGCTC 2940 

89 GGACTGGTGG GCCGCCGGCT CGGTCTGCAC ACGGGGTACC GGGAGCTGCG CTCGGGCACG 3000 

90 GTCTACCACG ACGTGTACCC GTCGCCCGGC GCGCACCACC TGTCCTCGGA GACCTCCGAG 3060 

91 ACGCTGCTGG AGTTCCACAC GGAGATGGCC TACCACCGGC TCCAGCCGAA CTACGTCATG 3120 
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92 CTGGCCTGCT CCCGGGCCGA CCACGAGCGC ACGGCGGCCA CACTCGTCGC CTCGGTCCGC 3180 

93 AAGGCGCTGC CCCTGCTGGA CGAGAGGACC CGGGCCCGGC TCCTCGACCG GAGGATGCCC 3240 

94 TGCTGCGTGG ATGTGGCCTT CCGCGGCGGG GTGGACGACC CGGGCGCCAT CGCCCAGGTC 3300 

95 AAACCGCTCT ACGGGGACGC GGACGATCCC TTCCTCGGGT ACGACCGCGA GCTGCTGGCG 3360 

96 CCGGAGGACC CCGCGGACAA GGAGGCCGTC GCCGCCCTGT CCAAGGCGCT CGACGAGGTC 3420 

97 ACGGAGGCGG TGTATCTGGA GCCCGGCGAT CTGCTGATCG TCGACAACTT CCGCACCACG 3480 

98 CACGCGCGGA CGCCGTTCTC GCCCCGCTGG GACGGGAAGG ACCGCTGGCT GCACCGCGTC 3540 

99 TACATCCGCA CCGACCGCAA TGGACAGCTC TCCGGCGGCG AGCGCGCGGG CGACGTCGTC 3600 

100 GCCTTCACAC CGCGCGGCTG AGCTCCCGGG TCCGACACCG CGCGGCTGAA CCCACGGTCC 3660 

101 GGGGCCCACG GTCCGGCACC GCGCGGCTGA GCCCCCGGGT CCGGCAGCGG GCGGCTGAAC 3720 

102 CCCCGCCCCG GGCCACCGCC CGACCGCCCC CGCGCACCGG ACGCGCCCGC CTGTACGGCG 3780 

103 GTCCCGCCCG GGCCCGTACA CCTGAAGCGC CCGGCGGACC GCCGCCCCGC CGGGGGACGG 3840 

104 ACAGAGCCGG GTGCGGGAGG ACGTCCTCCC GCACCCGGCT CCCACCGTTC CGCACCGACC 3900 

105 GCACCCGACC GTGCCGCAGG CGCCACCGGC ACCGCACCGC CCGCGCCGGC AGCCACCACA 3960 

106 GGCGCCACGC CGCCCGCACG GTGCCCGCGC TGCTCAGCCC CCGTCCACCG GGCTGTCCAG 4 020 

107 CAGCCGCCGC AGCGCGCCCC CGATGAACTC CCGGTCGGCG GCCGACCCCC CGGACCCCGC 4080 

108 GAGATGCCCC CACACTCCCG GGATCACCTC CAGCGAGGCA TACGGCAGCA GATCGGCCAC 4140 

109 CCGCTTCTCG TCCTCGACGG CGAAACACAC GTCCAGGGCG CCCGGCAGCA CCACGGCCCG 4200 

110 CGCCGTGACG GAGGCCAGCG CCGCCTCGAC GCTCCCCCCG GCCCCGGGTG TCGCCCCCAC 4260 

111 ATCCGTGTTC TCCCAGGTGC GCACCATGGT GAGCAGATCC GCGGCGCCGG GCCCGGAGAG 4320 

112 GAAGACCTGC TCCCAGAAGC CGGTGAGGTA CTCCTCGCGG GTGGCGAAAC CCAGCTCCCG 4380 

113 GTGGGCACGG CGGGCCCAGA AGGAACGCGA GGTCCCCCAC CCGGCGAACA CCCGGCCCGC 4440 

114 CGCCTTCCGC CCCCGCTCCC CGGCGTCGGC GCTGAGCGCC GCGGCCAGAC CGGACAGCAG 4500 

115 GACCAGGCTG TGCGGGCTGC TCACCGGCGC CCCGCAGATC GGGGCGATCC GGCGCACCAT 4560 

116 CCCCGGATGC GACACGGCCC ACTGGTAGGC GTGGGCCGCG CCCATCGACC AGCCCGTGAC 4620 

117 CAGGGCCAGT TCCCGTACCC CCAGCTCCTC GGTGAGCAGC CGGTGCTGCG CCGCGACATT 4680 

118 GTCCTGCGGA GTGATCAGCG GAAAGCGGGA CCCCGACGGG TGGTTGCCGG GCGAGCTGGA 4740 

119 GACCCCGTTG CCGAAGAGTC CGGCGGTGAC GACGCAGTAC CGCCGGGTGT CCAGCGGCAG 4800 

120 CCCCGCACCG ATCAGCCAGT CGTACCCGGT GTGGTCCCGG CCGAAGAACG ACGGACAGAG 4860 

121 CACCACGTTC GTCCCGTCGG CGTTCGGCGT GCCGTACATG GCGTAACCGA TCCGGGCGTC 4920 

122 CCGCAGGACC TCCCCGTCCA GCAACGGCAG TTCGTCGATC TCGAATATGC GGCATTCCAC 4980 

123 CGCTGACCTC CTTGTTCGAT CCCCCCGGAC AACAGGTCGG TCGTGGCCGG AGACTCAGAG 5040 

124 CCAGTTGGGG GCGATCTCGG TGGCCCACAG CTCCAGGCTG CGCAGCTGGA CATCGTGCGG 5100 

125 GATCAGCCCG GAGTACTGGC ACTGGAGCAG ATACTCCGGA TCGTGCCGCT CCACCAGCTT 5160 

126 CTCGATCATG CGGTTGATGT CGTCCGGGGT GCCGACCCAC TCCAGCCCCC GGTCGACCAG 5220 

127 GGTCTTGTAG TCCGAGCCGA TCGGACCCGT CTCGCCGGTC GCGCGCAGCG CCTCGGTGAA 5280 

128 GCCCATGGGG CCGAACCAGT TCTCGAAGAT GAAGCCGCCG CCGCGGGACG CCCAGTGGTG 5340 

129 GGCCTCGCCG GAGTCCCGGG AGACCAGGAC GTCCTTCATC ACCCCGACCC GCTCGCCCCG 5400 

130 CCGCAGGGTG CCGTGGCCCG CCGCCTCGGC CTCCTCCCGG TAGATGTCCA TCAGCCGGGC 5460 

131 GACGATCTGG TCGTCGGTGT TCATCAGGAT CGGCACCACG CCCTCCCGGG CACAGAACCG 5520 

132 GAACGTGTCC TCACTGAAGC TGAACGGCTG GAAGACGGGC GGGTGGGGGC GCTGGTAGGG 5580 

133 CTTGGGCGCG ATGCCCACCT CGCGGATGAC GCCGTTCTCG TCGAGGCCCC GGCCGTAGCG 5640 

134 GCGCACCGCC TCGTAGGGGA ACTCCAGGTC CGGCACCGGG ATCGTCCACT GCTCCCCGGA 5700 

135 GTGGGTGAAC GTCTCGGTCG TCCACGCCTT CTTGATGATC TCCCAGTGCT CCTCGAAGAG 5760 

136 GGCACGATTG CGCCGGTCCC GCTCCCCGGC GTCGGACAGG GTGCCGCCGA CCCCGTACAC 5820 

137 CTGCCCCATG ATGTCGGCCC AGCGCTTCTG GAACCCGCGC GCGATCCCGA CGAAGGCGCG 5880 

138 GCCCCGGGTC ATGTGGTCGA GCATCGCCAG ATCCTCGGCC AGCCGCAGCG GATTGTGCAG 5940 

139 CGGCAGGACG TTGGCCATCT GGCCGACCCG GATGTGCCGG GTCTGCATGC CGAGGTAGAG 6000 

140 CCCCAGCATG ATCGGGTTGT TGGAGACCTC GAAACCCTCG GTGTGGAAGT GGTGCTCGGT 6060 
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141 GAAGGACAGT CCCCAGTAGC CGAGTTCGTC GGCCGCCTGC GCCTGCCGGG TGAGCTGCCG 6120 

142 GAGCATGTTC TGGTAGTTCT GCGGATTGAC CCCCGCCATA CCCCGCTGGA CCTGCGCATG 6180 

143 ACTGCCGACC GTTGGCAGAT AGAAGAGAAT GGACTTCACC CTGGCTCCTC CGGTTCGCGG 6240 

144 CGCCCTCCAT TGACGTGCGC CGAAAGCGGC TCGACCGTCC CACTCCGCCC TTGAGTTCCG 6300 

145 TCTGACGCCG CGCCAGTCGG CGGGCCGTCC GCCGGGGTGC CCGCCGGGGT CCGCACCCGC 6360 

146 CGGACGGCAC GGCGCGCACC GCGCGCGCGG CGCTTCGGGG CACCGGGCTC GACGGGGTGC 6420 

147 TCAGCGGGAC GTCCAACGGA AGGCAAGCCC CCGTACCCAG CCTGGTCAAG GCGCTCATCG 6480 

148 CCATTCCCTG AGGAGGTCCC GCCTTGACCA CAGCAATCTC CGCGCTCCCG ACCGTGCCCG 654 0 

149 GCTCCGGACT CGAAGCACTG GACCGTGCCA CCCTCATCCA CCCCACCCTC TCCGGAAACA 6600 

150 CCGCGGAACG GATCGTGCTG ACCTCGGGGT CCGGCAGCCG GGTCCGCGAC ACCGACGGCC 6660 

151 GGGAGTACCT GGACGCGAGC GCCGTCCTCG GGGTGACCCA GGTGGGCCAC GGCCGGGCCG 6720 

152 AGCTGGCCCG GGTCGCGGCC GAGCAGATGG CCCGGCTGGA GTACTTCCAC ACCTGGGGGA 6780 

153 CGATCAGCAA CGACCGGGCG GTGGAGCTGG CGGCACGGCT GGTGGGGCTG AGCCCGGAGC 6840 

154 CGCTGACCCG CGTCTACTTC ACCAGCGGCG GGGCCGAGGG CAACGAGATC GCCCTGCGGA 6900 

155 TGGCCCGGCT CTACCACCAC CGGCGCGGGG AGTCCGCCCG TACCTGGATA CTCTCCCGCC 6960 

156 GGTCGGCCTA CCACGGCGTC GGATACGGCA GCGGCGGCGT CACCGGCTTC CCCGCCTACC 7020 

157 ACCAGGGCTT CGGCCCCTCC CTCCCGGACG TCGACTTCCT GACCCCGCCG CAGCCCTACC 7080 

158 GCCGGGAGCT GTTCGCCGGT TCCGACGTCA CCGACTTCTG CCTCGCCGAA CTGCGCGAGA 7140 

159 CCATCGACCG GATCGGCCCG GAGCGGATCG CGGCGATGAT CGGCGAGCCG ATC 7193 

161 (2) INFORMATION FOR SEQ ID NO: 2: 

162 (i) SEQUENCE CHARACTERISTICS: 

163 (A) LENGTH: 145 base pairs 

164 (B) TYPE: nucleic acid 

165 (C) STRANDEDNESS : single 

166 (D) TOPOLOGY: linear 

167 (ii) MOLECULE TYPE: Other 

168 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

169 GTGACCCGGC CTCCGGGCCT TTCCGCGCAC ACCCACGGGT CCGTGTCCGG GAGTCTGCTG 60 

170 CGCCGGGTGG CGGGCCACTA TCCCACCGGG GTGGTCCTGG TCACCGGTCC GGCCGAGGCT 120 

171 CCGGGGCAGC CGCCGCCCGC CATGG 145 

173 (2) INFORMATION FOR SEQ ID NO: 3: 

174 (i) SEQUENCE CHARACTERISTICS: 

175 (A) LENGTH: 453 base pairs 

176 (B) TYPE: nucleic acid 

177 (C) STRANDEDNESS: single 

178 (D) TOPOLOGY: linear 

179 (ii) MOLECULE TYPE: Other 

180 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

181 ATGTCCGTGG CATCGGCCGG TATGACGGAC GAGCAGCGCA AGGCGGTCAT CACCGCGTAC 60 

182 TTCAAGGCGT TCGACAACGG CGGCGTCGGC AGCGACGGCA CCCCCGCGAT CGACTACTTC 120 

183 GCCGAGGACG CGGTCTTCTT CTTCCCCAAG TGGGGTCTGG CCCGGGGCAA GTCCGAGATC 180 

184 GCCCGGCTCT TCGACGACCT CGGGGGCACC ATCCGCTCGA TCACCCACCA TCTGTGGTCC 240 

185 GTCAACTGGA TTCTGACCGG GACCGAACTC CTCGCCGCGG AGGGCACCAC CCACGGTGAG 300 

186 CACCGGGACG GGCCGTGGCG GGCGGGTGAC CCCGAGTGGG CCGCCGGGCG CTGGTGCACG 360 

187 GTCTACGAGG TGCGGGACTT CCTCGTCCAC CGGGCCTTCG TCTATCTGGA CCCCGATTAC 420 

188 GCGGGCAAGG ACACCGCGCG TTACCCGTGG CTG 453 

190 (2) INFORMATION FOR SEQ ID NO: 4: 

191 (i) SEQUENCE CHARACTERISTICS: 

192 (A) LENGTH: 1032 base pairs 
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193 (B) TYPE: nucleic acid 

194 (C) STRANDEDNESS : single 

195 (D) TOPOLOGY: linear 

196 (ii) MOLECULE TYPE: Other 

197 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

198 ATGTCCCGCT CTCCGCCCGA GTCCCCGGCC GGTTCCGTGT CCGCCGCGGT TCCGCGTCCG 60 

199 CCGGTCCGCG CCCTGCGGGA CCTTCCGGTC AGTGCCCAGG GGCTCGGCTG CCTGCCGACC 120 

200 ACCGACTTCT ACGGACGCCC GGACCGCGCC CGGGCGACGG CCACCATCCG CGCCGCCGTC 180 

201 GACGCCGGGG TCACCCTGCT GGACACCGCC GACGTCCAGG GGCTCGGCGC CGGTGAGGAG 240 

202 CTGCTCGGAC GGGCGGTCGC GGGCCGCCGG GACGAGGTGC TGATCGCCAC CAAGTTCGGC 300 

203 ATGGTGCGCT CGTCCGACGG CGCCTCCCAG GGCTTGTGCG GCGAGCCGTC CTACGTCCGC 360 

204 GCGGCCTGCG AACGGTCCCT GCGTCGTCTC GGCACCGACC GCATCGACCT GTACTACCAG 420 

205 CACTGGACGG ACCCGGCGGT GCCGATCGAG GAGACCGTGG GTGCGGTGGC CGAGCTGGTG 480 

206 CGCGAGGGCA AGGTCCGCAG GCTCGGTCTC TCCGAGCCCT CCGCGGCCAC GCTGCGCCGG 540 

207 GCGGACGCGG TGCACCCGGT GACGGCGGTG CAGAGCGAGT GGAGCCTGTG GTCGCGCGGG 600 

208 ATCGAGGACG AGGTGGTGCC CGTCTGCCGG GAGCTGGGGA TCGGGATCGT CGCTTACGCC 660 

209 CCTCTGGGAC GGGGTTTTCT CACCGGCACC ATCCGCACCA CCGACGATCT GGGGGACGAG 720 

210 GACTTCCGCC GGGGCCAGCC CCGGTTCAGC GCTCCGGCCC TCGCGCGCAA CCGCTCGTTG 780 

211 CTGCACCGGC TGCGCCCGGT CGCGGACGGT CTGGGGCTGA CCCTGGCACA GCTCGCGCTC 840 

212 GCCTGGCTGC ACCACCGGGG CGAGGACGTC GTCCCGATCC CGGGCACCGC GAACCCGGCC 900 

213 CATCTCGCGG ACAATCTCGC CGCCGCCTCG ATCCGGCTGG ACGACCGGTC CCTCGCGGAG 960 

214 GTGACGGCCG CGATCTCCCA CCCGGTGTCC GGGGAGCGGT ACACCCCGGC ATTGCTCGCC 1020 

215 ATGATCGGCA AC 1032 

217 (2) INFORMATION FOR SEQ ID NO: 5: 

218 (i) SEQUENCE CHARACTERISTICS: 

219 (A) LENGTH: 984 base pairs 

220 (B) TYPE: nucleic acid 

221 (C) STRANDEDNESS: single 

222 (D) TOPOLOGY: linear 

223 (ii) MOLECULE TYPE: Other 

224 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

225 GTGGAATGCC GCATATTCGA GATCGACGAA CTGCCGTTGC TGGACGGGGA GGTCCTGCGG 60 

226 GACGCCCGGA TCGGTTACGC CATGTACGGC ACGCCGAACG CCGACGGGAC GAACGTGGTG 120 

227 CTCTGTCCGT CGTTCTTCGG CCGGGACCAC ACCGGGTACG ACTGGCTGAT CGGTGCGGGG 180 

228 CTGCCGCTGG ACACCCGGCG GTACTGCGTC GTChCCGCCG GACTCTTCGG CAACGGGGTC 240 

229 TCCAGCTCGC CCGGCAACCA CCCGTCGGGG TCCCGCTTTC CGCTGATCAC TCCGCAGGAC 300 

230 AATGTCGCGG CGCAGCACCG GCTGCTCACC GAGGAGCTGG GGGTACGGGA ACTGGCCCTG 360 

231 GTCACGGGCT GGTCGATGGG CGCGGCCCAC GCCTACCAGT GGGCCGTGTC GCATCCGGGG 420 

232 ATGGTGCGCC GGATCGCCCC GATCTGCGGG GCGCCGGTGA GCAGCCCGCA CAGCCTGGTC 480 

233 CTGCTGTCCG GTCTGGCCGC GGCGCTCAGC GCCGACGCCG GGGAGCGGGG GCGGAAGGCG 540 

234 GCGGGCCGGG TGTTCGCCGG GTGGGGGACC TCGCGTTCCT TCTGGGCCCG CCGTGCCCAC 600 

235 CGGGAGCTGG GTTTCGCCAC CCGCGAGGAG TACCTCACCG GCTTCTGGGA GCAGGTCTTC 660 

236 CTCTCCGGGC CCGGCGCCGC GGATCTGCTC ACCATGGTGC GCACCTGGGA GAACACGGAT 720 

237 GTGGGGGCGA CACCCGGGGC CGGGGGGAGC GTCGAGGCGG CGCTGGCCTC CGTCACGGCG 780 

238 CGGGCCGTGG TGCTGCCGGG CGCCCTGGAC GTGTGTTTCG CCGTCGAGGA CGAGAAGCGG 840 

239 GTGGCCGATC TGCTGCCGTA TGCCTCGCTG GAGGTGATCC CGGGAGTGTG GGGGCATCTC 900 

240 GCGGGGTCCG GGGGGTCGGC CGCCGACCGG GAGTTCATCG GGGGCGCGCT GCGGCGGCTG 960 

241 CTGGACAGCC CGGTGGACGG GGGC 984 
243 (2) INFORMATION FOR SEQ ID NO: 6: 
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D08S 

SEQUENCE LISTING 

1 (1) GENERAL INFORMATION: 

2 (i) APPLICANT: SmithKline Beecham pic et al 

3 (ii) TITLE OF INVENTION: Novel compounds 

4 (iii) NUMBER OF SEQUENCES: 19 

5 (iv) CORRESPONDENCE ADDRESS: 

6 (A) ADDRESSEE: SmithKline Beecham 

7 (B) STREET: Two, New Horizons Court, Great West Road 

8 (C) CITY: Brentford 

9 (D) STATE: 

10 (E) COUNTRY: UK 

11 (F) ZIP: TW8 9EP 

12 (v) COMPUTER READABLE FORM: 

13 (A) MEDIUM TYPE: Diskette 

14 (B) COMPUTER: IBM Compatible 

15 (C) OPERATING SYSTEM: DOS 

16 (D) SOFTWARE: FastSEQ for Windows Version 2.0 

17 (vi) CURRENT APPLICATION DATA: 
C--> 18 (A) APPLICATION NUMBER: US/10/071 f 338 
C--> 19 (B) FILING DATE: 08-Feb-2002 

20 (C) CLASSIFICATION: 

21 (vii) PRIOR APPLICATION DATA: 

22 (A) APPLICATION NUMBER: 

23 (B) FILING DATE: 

24 (viii) ATTORNEY/AGENT INFORMATION: 

25 (A) NAME: Valentine, Jill B 

26 (B) REGISTRATION NUMBER: 

27 (C) REFERENCE/DOCKET NUMBER: P31731 

28 (ix) TELECOMMUNICATION INFORMATION: 

29 (A) TELEPHONE: 0181-9752000 

30 (B) TELEFAX: 0181-9756294 

31 (C) TELEX: 



ERRORED SEQUENCES 

602 (2) INFORMATION FOR SEQ ID NO: 19: 



603 (i) SEQUENCE CHARACTERISTICS: 

604 (A) LENGTH: 324 amino acids 

605 <B) TYPE: amino acid 

606 (C) STRANDEDNESS : single 

607 (D) TOPOLOGY: linear 

608 (ii) MOLECULE TYPE: protein 
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DATE: 
TIME: 



609 

610 

611 

612 

613 

614 

615 

616 

617 

618 

619 

620 

621 

622 

623 

624 

625 

626 

627 

628 

629 

630 

631 

632 

633 

634 

635 

636 

637 

638 

639 

640 

641 

642 

643 

644 

645 

646 

647 

648 

649 

650 

651 



RAW SEQUENCE LISTING 
PATENT APPLICATION: US/10/071 f 338 

input Set : N:\Crf3\02272002\J071338.raw 
Output Set: N:\CRF3\03192002\J071338.raw 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
( lit Thr Ser Val Asp Cys Thr Ala Tyr Gly Pro Glu Leu 
1 5 10 

Ala Ala Arg Leu Pro Arg Thr Pro Arg Ala Asp Leu Tyr 

Asp Ala Ala His Thr Ala Ala Ala Ser Leu Pro Gly Ala 

35 ^ 
Ala Leu Asp Thr Phe Asn Ala Glu Gly Ser Glu Asp Gly 

50 55 
Leu Arg Gly Leu Pro Val Glu Ala Asp Ala Asp Leu Pro 

65 70 7 u 

Ser Ser Thr Pro Ala Pro Glu Asp Arg Ser Leu Leu Thr 

85 90 . mv 

Met Leu Gly Leu Val Gly Arg Arg Leu Gly Leu His Thr 

100 105 
Glu Leu Arg Ser Gly Thr Val Tyr His Asp Val Tyr Pro 

115 120 
Ala His His Leu Ser Ser Glu Thr Ser Glu Thr Leu Leu 

130 135 
Thr Glu Met Ala Tyr His Arg Leu Gin Pro Asn Tyr Val 

145 I 50 15 

Cys Ser Arg Ala Asp His Glu Arg Thr Ala Ala Thr Leu 

Val Arg Lys Ala Leu Pro Leu Leu Asp Glu Arg Thr Arg 

180 185 
Leu Asp Arg Arg Met Pro Cys Cys Val Asp Val Ala Phe 

195 200 
Val Asp Asp Pro Gly Ala lie Ala Gin Val Lys Pro Leu 

210 21 -> 
Ala Asp Asp Pro Phe Leu Gly Tyr Asp Arg Glu Leu Leu 
225 230 235 

Asp Pro Ala Asp Lys Glu Ala Val Ala Ala Leu Ser Lys 

245 250 
Glu Val Thr Glu Ala Val Tyr Leu Glu Pro Gly Asp Leu 

265 

Asp Asn Phe III Thr Thr His Ala Arg Thr Pro Phe Ser 

275 2Q0 z 

Asp Gly Lys Asp Arg Trp Leu His Arg Val Tyr lie Arg 

290 295 
Asn Gly Gin Leu Ser Gly Gly Glu Arg Ala Gly Asp Val 
305 310 315 

jrhr Pr o_A rg Gly . 

attorney Docket : P31731 
' Group Art Unit: 1652 
? cket : P31731 

Group Art Unit: 1652 



03/19/2002 
17:52:50 



Arg Ala Leu 
15 

Ala Phe Leu 
30 

Leu Ala Thr 

His Leu Leu 

Thr Thr Pro 
80 

Met Glu Ala 
95 

Gly Tyr Arg 
110 

Ser Pro Gly 

Glu Phe His 

Met Leu Ala 
160 

Val Ala Ser 

175 
Ala Arg Leu 
190 

Arg Gly Gly 

Tyr Gly Asp 

Ala Pro Glu 
240 

Ala Leu Asp 
255 

Leu He Val 
270 

Pro Arg Trp 

Thr Asp Arg 

Val Ala Phe 
320 



Mite 
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VERIFICATION SUMMARY DATE: 03/19/2002 

PATENT APPLICATION: US/10/071,338 TIME: 17:52.51 

input Set : N:\Crf3\02272002\J071338.raw 
Output Set: N:\CRF3\03192002\J071338.raw 

L:18 M:220 C: Keyword misspelled or invalid format, [(A, f™™™™™***^ 
L-19 M:220 C: Keyword misspelled or invalid format, [ B) FILING DATE . ] 
L-653 M-332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID.O 
L-653 M-333 E: Wrong sequence grouping, Amino acids not in groups. 
1^653 M:330 E: (2) Invalid Amino Acid Designator, 3 

L:653 M:203 E: No. of Seq. differs, LENGTH : Input : 324 Found: 327 SEQ. 19 
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